3,778 research outputs found

    Efficient estimation of AUC in a sliding window

    Full text link
    In many applications, monitoring area under the ROC curve (AUC) in a sliding window over a data stream is a natural way of detecting changes in the system. The drawback is that computing AUC in a sliding window is expensive, especially if the window size is large and the data flow is significant. In this paper we propose a scheme for maintaining an approximate AUC in a sliding window of length kk. More specifically, we propose an algorithm that, given ϵ\epsilon, estimates AUC within ϵ/2\epsilon / 2, and can maintain this estimate in O((logk)/ϵ)O((\log k) / \epsilon) time, per update, as the window slides. This provides a speed-up over the exact computation of AUC, which requires O(k)O(k) time, per update. The speed-up becomes more significant as the size of the window increases. Our estimate is based on grouping the data points together, and using these groups to calculate AUC. The grouping is designed carefully such that (ii) the groups are small enough, so that the error stays small, (iiii) the number of groups is small, so that enumerating them is not expensive, and (iiiiii) the definition is flexible enough so that we can maintain the groups efficiently. Our experimental evaluation demonstrates that the average approximation error in practice is much smaller than the approximation guarantee ϵ/2\epsilon / 2, and that we can achieve significant speed-ups with only a modest sacrifice in accuracy

    A tool for subjective and interactive visual data exploration

    Get PDF
    We present SIDE, a tool for Subjective and Interactive Visual Data Exploration, which lets users explore high dimensional data via subjectively informative 2D data visualizations. Many existing visual analytics tools are either restricted to specific problems and domains or they aim to find visualizations that align with user’s belief about the data. In contrast, our generic tool computes data visualizations that are surprising given a user’s current understanding of the data. The user’s belief state is represented as a set of projection tiles. Hence, this user-awareness offers users an efficient way to interactively explore yet-unknown features of complex high dimensional datasets

    An optical fibre dynamic instrumented palpation sensor for the characterisation of biological tissue

    Get PDF
    AbstractThe diagnosis of prostate cancer using invasive techniques (such as biopsy and blood tests for prostate-specific antigen) and non-invasive techniques (such as digital rectal examination and trans-rectal ultrasonography) may be enhanced by using an additional dynamic instrumented palpation approach to prostate tissue classification. A dynamically actuated membrane sensor/actuator has been developed that incorporates an optical fibre Fabry–Pérot interferometer to record the displacement of the membrane when it is pressed on to different tissue samples. The membrane sensor was tested on a silicon elastomer prostate model with enlarged and stiffer material on one side to simulate early stage prostate cancer. The interferometer measurement was found to have high dynamic range and accuracy, with a minimum displacement resolution of ±0.4μm over a 721μm measurement range. The dynamic response of the membrane sensor when applied to different tissue types changed depending on the stiffness of the tissue being measured. This demonstrates the feasibility of an optically tracked dynamic palpation technique for classifying tissue type based on the dynamic response of the sensor/actuator

    Dear Wife : the Civil War letters of Chester K. Leach

    Get PDF
    Occasional paper (University of Vermont. Center for Research on Vermont) ; no. 20

    Bioinformatics tools in predictive ecology: Applications to fisheries

    Get PDF
    This article is made available throught the Brunel Open Access Publishing Fund - Copygith @ 2012 Tucker et al.There has been a huge effort in the advancement of analytical techniques for molecular biological data over the past decade. This has led to many novel algorithms that are specialized to deal with data associated with biological phenomena, such as gene expression and protein interactions. In contrast, ecological data analysis has remained focused to some degree on off-the-shelf statistical techniques though this is starting to change with the adoption of state-of-the-art methods, where few assumptions can be made about the data and a more explorative approach is required, for example, through the use of Bayesian networks. In this paper, some novel bioinformatics tools for microarray data are discussed along with their ‘crossover potential’ with an application to fisheries data. In particular, a focus is made on the development of models that identify functionally equivalent species in different fish communities with the aim of predicting functional collapse
    corecore